Querying Uncertain Data in Heterogeneous Databases
نویسندگان
چکیده
In heterogeneous databases the user may issue a query to join two relations in di erent databases on the keys However the keys may be incompatible In this paper we extend our results on probabilis tic query processing to consider joining two relations on incompatible keys A new approach to identify the same entities in di erent relations is proposed Various data and schema con icts such as missing data inconsistent data and domain mismatch are considered in the identi cation process Probabilis tic techniques are used to estimate the sameness of two entities to process queries and to estimate the degree of uncertainty for the query results
منابع مشابه
Extending dynamic queries to handle uncertain data
Dynamic querying is a technique which has been used successfully to enable novice users to gain access to and insight into data in databases. Some multimedia archives (such as archives of African art) contain data which have vague locations in time and space, that is, although there is some idea of when and where the entity originated, the precise information is unknown. This uncertainty create...
متن کاملQuerying Heterogeneous Databases Using Standardized Schemas and SQL
Making databases available for querying both within and across organizations is a vision held by many. Realizing this vision, however, is usually hampered by the existence of heterogeneous database systems, heterogeneous query languages and heterogeneous data semantics. What is needed is a uniform method for accessing these databases. This paper presents a standards based approach in which SQL ...
متن کاملQuerying Heterogeneous Mediated Sources: A Survey
Data integration systems allow access to information in increasingly different forms: relational databases, spreadsheets, web pages, and so on. Querying such heterogeneous sources is challenging due to non-uniform query capability of sources, variety of schema and data models, and limitations on access paths. Most systems use some form of mediation to allow access to heterogeneous sources. Some...
متن کاملQuerying Nested Historical Relations in Heterogeneous Databases Environment
We study schema integration problems for consolidating historical information from nested relational databases in heterogeneous databases environment. These nested relations are for supporting complex objects. In heterogeneous databases systems, probabilistic partial values have been used to resolve some schema integration problems. In this paper, we extend the concept of probabilistic partial ...
متن کاملIndexing the Earth Mover's Distance Using Normal Distributions
Querying uncertain data sets (represented as probability distributions) presents many challenges due to the large amount of data involved and the difficulties comparing uncertainty between distributions. The Earth Mover’s Distance (EMD) has increasingly been employed to compare uncertain data due to its ability to effectively capture the differences between two distributions. Computing the EMD ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1993